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Abstract 

We introduce the notion of weight for the lattice dimension and the notion of 
topological dimension - hole dimension. The condensate in Bose-holes exists in the 
case when temperature is not low. 

In our previous papers we applied the quantum statistics approach to linguistic statis- 
tics p. 

To each word a frequency dictionary assigns its number of occurrences in the corre- 
sponding corpus of texts. There may be several words with the same number of occur- 
rences. 

The rank of a word (i.e., the order number of a word in a frequency dictionary) is 
measured from the word A\ with the highest number of occurrences s in a given corpus 
of texts. The number of all words with the same number of occurrences % we designate 
as iVj. Then, if A m is the last word in a list of words with number of occurrences m (the 
words of the same number of occurrences can be arranged in an arbitrary way), then the 
rank r m of A m is obviously 

s 

r m = J2 Ni - 

i=m 

There is an analogy between the Bose particles at the energy level of an oscillator \ = i 
and the words with the occurrence number i, namely, the words with the same occurrence 
number can be ordered in an arbitrary way, say, alphabetically, inversely alphabetically, 
or in any other order. The indexing of the ranks (the indices) of words within the family 
of a given occurrence number is arbitrary. In this sense, the words are indistinguishable 
and are distributed according to the Bose statistic. 

However, there is a difference between the approaches under consideration. In the 
frequency dictionary one evaluates the number of occurrences of every word and then 
orders the words, beginning with the most frequently occurring words. 

When there were no computers, it was difficult for a person to evaluate the number 
of words with equal occurrence number. By looking at a page as if it were a picture, a 
person can determine a desired word on this page by its graphical form at every place of 
occurrence of the word. In this case, the person looks at a page of the text as if it were a 
photo, without going into the meaning. Similarly, if a person looks for a definite name in 
a long list of intrants who had entered a college, this person finds (or does not find) the 
desired name by eyes rather than reads all the names one after another. 

An eye gets into the way of recognizing the desired image, and this ability intensifies 
as the viewed material increases: the more pages the eye scans, the less is the difficulty 
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in finding the desired graphical form. Therefore, under a manual counting, it was simpler 
to recognize the desired word on a page without reading the text and to cross it out by a 
pencil, simultaneously counting the number of occurrences of the word. This procedure 
is repeated for any subsequent word, already using the text with the words crossed out 
("holes"), which facilitates the search. In other words, the procedure is in the recognition 
of the image of the given word, similar to the recognition, say, of a desired mushroom 
in a forest without sorting out all the plants on the soil one after another. An ordinary 
computer solves problems of this kind by exhaustion, whereas a quantum computer (see [21 
3]) makes this by recognizing the image. 

However, for an ordinary computer, the number of operations needed to find the 
occurrence frequency of a word is less than the number of operations needed to find the 
number of words in the text with a given occurrence frequency. 

One can say that the number of mushrooms we gathered (took away from the forest) is 
the number of holes we left in the forest. Similarly, the words we had "got out" from the 
text in the above way is an analog of holes rather than particles. Therefore, the linguists 
count the rank of words starting from the opposite end as compared to the starting end 
which would be used by physicists. The physicists would count the particles starting 
from the lowest level, whereas the holes, the absent electrons, would be counted from the 
highest level. 

For this reason, the words in a frequency dictionary are associated with holes rather 
than particles. Correspondingly, the dimension in the distribution of frequency dictionar- 
ies is to be chosen as a "hole" dimension ("Dirac's hole"), which is negative. 

We will prove a cumulative formula in which the densities coincide in shape with the 
Bose-Einstein distribution. The difference consists only in that, instead of the set X n of 
random variables or eigenvalues of the Hamiltonian operator, the Bose-Einstein formula 
contains some of their averages over the cells [lj. In view of the theorem given below, one 
can proof that the Si, which are averages of the energy A& at the ith cell, are nonlinear 
averages in the sense of Kolmogorov [5] . 

As in [6J, the values of the random variable Ai, . . . , X s are ordered in absolute value. 
Some of the numbers Ai, . . . , X s may coincide. Then these numbers are combined adding 
the corresponding "probabilities", i.e., the ratio of the number of "hits" at Aj to the 
general number of trials. The number of equal Aj : Aj = A i+1 = ■ ■ ■ = X i+k will be called 
the multiplicity of the value Aj. In our consideration, both the number of trials N and s 
tend to infinity. 

Let Ni be the number of "appearances" of the value Aj : Aj < A i+ i, then 

1=1 

where M is the mathematical expectation. 

The cumulative probability Vk is the sum of the first k probabilities in the sequence 
A 4 ; V k = ± Yh=i N ii where k < s. We denote NP k = B k . 

If all the variants for which 

s 

X>i = # (2) 

8=1 

and 

s 

NiXi < E, E = MN < NX, (3) 

i=i 



where A = Y ' U Q qiX \ Q = ££= zl qi, are equivalent (equiprobable) , then [5J El [S] the ma- 
jority of the variants will accumulate near the following dependence of the "cumulative 
probability" B,{iVi} = N,, 



i=l i=l 



where and z/ are determined by the conditions 

B, = N, (5) 

8=1 

as iV — > 00 and s — > oo. 

We introduce the notation: .M is the set of all sets {iVj} satisfying conditions (j2J) 
and (j3J); 7V{.M} is the number of elements of the set A4. 

Theorem 1 Suppose that all the variants of sets {iVj} satisfying the conditions (Tj|p 
and (TJ|) are equiprobable. Then the number of variants Af of sets {iVj} satisfying condi- 
tions (TJ|) and (TJj] and the additional relation 

8=1 1 

is less than ClJ ^^ (where C\ and m are any arbitrary numbers, ^2\ =1 q% > sQ, and e is 
arbitrarily small). 

Proof of Theorem 1. 

Let A be a subset of M. satisfying the condition 

s s 



y Nl - y * — i<a 



i=l+l i=l+l 



I 



y N .-y * I < 



A. 



8=1 8=1 

where A, (3, v are some real numbers independent of I. 
We denote 



t=i+l i=Z+l 
1=1 8=1 

Obviously, if {iVj} is the set of all sets of integers on the whole, then 

s 

N{M \A} = Y^(e(E-J2 AiAi)*EU *)j&{Si ~ A)6(5^ - A)) , (8) 



{iVi} i=l 
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where Ni are arbitrary integers. 

Here the sum is taken over all integers Ni, 0(A) is the Heaviside function, and Sk u k 2 
is the Kronecker symbol. 

We use the integral representations 

W = V- / d<pe- iN ^ N 'e iN '^, (9) 

^ J-n 

1 r°° i 

e(y) = 7T-. / d\—e^ 1+a l (10) 

Now we perform the standard regularization. We replace the first Heaviside function 
in (jSJ) by the continuous function 



9a(y) = 



for a > 1, y < 

1 _ epvii-a) f or a > 1, y > 0, 



e /3y(l-a) for a < Qj y < 

1 for a < 0, ?/ > 0, 
where a G (— oo, 0) U (1, oo) is a parameter, and obtain 



1 f°° 1 1 
©«(y) = ^ / ^ 1+ ^( r ; )dx. (11) 

If a > 1, then G(y) < Q a (y). 

Let v < 0. We substitute (JSJ and ffTUj) into (jSJ), interchange the integration and 
summation, then pass to the limit as a — > oo and obtain the estimate 



■A/"{A< \ .A} < 



< 



e -vN+/3E pn 



/7T J J 

[exp(-ii\fy>) ^ exp{-/? ^ iV.A, + {icp + v) ^ iV,}] ^ x 

{AT,} 3=1 3=1 



Z(27T) 

xe($-A)e(S s _,-A) , (12) 



where /3 and z/ are real parameters such that the series converges for them. 

To estimate the expression in the right-hand side, we bring the absolute value sign 
inside the integral sign and then inside the sum sign, integrate over <p, and obtain 

M{M \ A} < ^ Yl ex P{"^ 2 NiXi + v 2 Ni * x 

{TV;} i=l i=l 

xe(s , ,-A)e(s , i _ l - A). (13) 

We denote 

Z(/3,iV) = ^e- /3 ^^ A % (14) 
where the sum is taken over all N such that Yli=i N = N, 

l s 
i=l i=l+l 
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It follows from the inequality for the hyperbolic cosine cosh(x) = (e x + e~ x )/2 for 
> 5; |a;2 1 > <5 ; 

cosh(xi) cosh(x 2 ) = cosh(xi + 22) + cosh(xi — #2) > — (15) 

2 

that the inequality 

1 s 

Q(S 8 -i - A)Q(Si - A) < e~ cA cosh (cJ2 N i ~ cosh(c JV< - c$ a _A (16) 

i=l i=Z+l 

where 



% - ft 



8=1 t=i+l 

holds for all positive c and A. 
We obtain 

A/"{M \A}< e~ cA exp - uN) x 

i j / 1 

x exp{-/3 Ni\i + v ^ N i} cosn ( ^2 cNi ~ ^ ) x 

{Ni} i=l i=l \i=l 

s s s 

x exp{— (3 ^2 + v ^2 Ni} cosh^ ^ ciVj — c<; 

i=i+l i=J+l i=l+l 

= e^e" cA x 

x (d(v - c, /?) exp(-c^) +Ci{v + c, /3) exp(c^)) x 

x (C—iiv ~ c, /3) exp(-c0 s _J + ( s _t(v + c, /3) exp(c0 s _J) . (17) 

Now we use the relations 

d d 

■7^ h\Ci\i3=p',v=v' = 4>u 0^1nCs-*|/3=/3V=i/ = <P s -i (18) 
and the expansion £j(z/ ± c, 0) by the Taylor formula. There exists a 7 < 1 such that 

\n(0(v ± c, ?)) = ln0(^/3) ±c(ln0)>,/3) + y(lnG)>±7c,/?). 

We substitute this expansion, use formula (flBl . and see that is cancelled. 
Another representation of the Taylor formula implies 

In (C { (f + c, (3)) = In (0(/3, 1/)) + ~ In *)) + 

rv+c/P o2 

+ y ^ + c//3-z/) — ln(CK/3y))- (19) 

A similar expression holds for £ s _j. 

From the explicit form of the function G(/3, u), we obtain 



92 1 /w/a ^ «2r ^exp(-/3(Ai + 1/)) 2 
— In (G(/3, 1/)) = /3 g (exp( _ /?(At + y)) _ 1)2 ^ Z 3 



(20) 



where d is given by the formula 

exp(-/3(Ai + i/)) 



d 



(exp(-/3(A 1 + u)) - If 



The same estimate holds for C,s-i- 

Taking into account the fact that CiCs-i — C«> we obtain the following estimate for 
P = p' and v = v': 

r 2 

M{M \A}< Cs(P', v) exp(-cA + -f3 2 Qd) exp(£/?' - v'N). (21) 
Now we express ( s (i>' ', f3') in terms Z(/3, N). To do this, we prove the following lemma. 
Lemma 1 Under the above assumptions, the asymptotics of the integral 

-uN yir 



has the form 



Z(/3, N) = — dae-^XsW, u + ia) (22) 

Z7T 



-»« CO?,") „ , „,i 



^^'^ ll^wi/WI ^V' (23) 

where C is a constant. 

The proof of the lemma readily follows from the saddle-point method and the inequal- 
ities 

\£i(u + ia,P)\ <£i(y,p), \( s (v + ia,P)\<Cs(vP), (24) 

which hold, because e u ~@ Xi < 1 for all a ^ 27m, where n is an integer. It follows from 
these inequalities that a = is a saddle point of integral (122]) [9J [TO] . 

Lemma 2 TTie quantity 

where ^ Ni = N and AjiVj < E — N l l 2+£ , tends to zero faster than N~ k for any k, e > 0. 

We consider the point of minimum in P of the right-hand side of (|17|) with v(P, N) 
satisfying the condition 

It is easy to see that it satisfies condition (jSJ). Now we assume that the assumption of the 
lemma is not satisfied. 

Then for J2 N i = N ,H W >E- N 1/2+£ , we have 

e PE ^ e-P'ELi^ > e (A rl/2 +^)/3_ 

{Ni} 

Obviously, P <C -7= provides a minimum of (JTTj) if the assumptions of Lemma 1 are 
satisfied, which contradicts the assumption that the minimum in P of the right-hand side 
of (JTTj) is equal to P'. 



We set c = N ^ +a in formula fl2T|) after the substitution fl23]) ; then it is easy to see that 
the ratio 

N\M \ A) _ 1 
Af(M) ~ N m ' 

where m is an arbitrary integer, holds for A = jV 3 / 4+e . The proof of the theorem is 
complete. 

Now we consider the notion of the lattice dimension. 

We consider a straight line, a plane, and a three-dimensional space. We separate 
points i = 0, 1, 2, . . . on the line and points x = i = 0, 1, 2, . . . , y = j = 0,1,2, .. . on the 
coordinate axes x, y on the plane. We associate this set of points with the points on 
the straight line (with the positive integers 1 = 1,2...) up to the quantum constant x °f 
the lattice. 

According to M. Gromov's definition [11], the asymptotic (topological) dimension of 
this lattice is equal to two. 

We associate each point with a pair of points i and j according to the rule i+j = I. The 
number of such points rii is equal to / + 1. In addition, we assume that z = k = 0,1,2, .. . 
on the axis, i.e., we set i + j + k = I. In this case, the number of points qi is equal to 

(/ + l)(/ + 2) 
qi = 2 ' 

If we set Aj = I in formula (jl]), then, in the three-dimensional case, each i is associ- 
ated with ^ +1 )(*+ 2 ) f mutually equal xi = I (these are the multiplicities or the qi— hold 
degeneracies of the spectrum of the oscillator). Formula (jl]) in this special case becomes 

ff.^fMM, (26) 



i=0 



for large i, ^ —> 



AN^caast ^Jl^^ A,, A t = 1, (27) 

AEi = const ff + ^ft + ^ A, (28) 
2(eP*-" - 1) K J 



uj 3 du h , . 

dE = const-, ; P = - 29 



(cf. formula (60.4) in [12]). 

Thus, we obtain a somewhat sharper version of the famous Planck formula for the 
radiation of a black body. 

For the D-dimensional case, it is easy to verify that the sequence of weights (multi- 
plicities) of the number of versions i = Y2k=i m ki where rrik are arbitrary positive integers, 
has the form of the binomial coefficient 

Qi(D) = const y - T - r A 30 

%\D\ 

where the constant depends on D. 

Thus, for any D, formula (jl]) has the form 

i=l 
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For the positive integers, we have a sequence of weights (or, simply, a weight) of the 
form (pjl. 

Our weight series can easily be continued to an arbitrary case by replacing the factorials 
with the T-functions; in this case, we assume that D is negative. 

This is the negative topological dimension (the hole dimension) of the quantized space 
(lattice). 

If D > 1, then, as i ^ oo, a condensation of a sufficiently small perturbation occurs 
in the spectrum of the oscillator and the multiplicities split, i.e., the spectrum becomes 
denser as i increases. The fact that D is negative means that there is strong rarefaction 
in the spectrum as i — > oo (the constant in formula (13TT) must be sufficiently large). 

For non-positive integer D, the terms i = 0, 1, 2, 3, . . . , — D become infinite. This 
means that they are very large in the experiment, which permits determining the lattice 
negative dimension corresponding to a given problem. We note that a new condensate 
occurs, which is possible for small f3. 

Now we return to frequency dictionaries. A frequency dictionary is composed using 
texts from a certain array. Each word in the dictionary is associated with the number of 
its encounters in the original array of texts. We denote this number by uji and the number 
of words corresponding to this number of encounters by iVj. The sum of all Ni is equal to 
the volume of the entire dictionary, and the sum of products o^n, is equal to the volume 
of the entire array of texts used to compose the dictionary. 

The number of words encountered only once in the array of texts is approximately 
equal to 1/3 of the entire frequency dictionary which the number of words equal to N. 
So as iV — > oo this is the condensate. It follows from the above that D — — 1 for the 
dictionary. Hence, for f3 -C 1 and v ~ 1, we have 

1 r duo 
Ni = const y — — r- r ~ const / ; — — -, (32) 

where u = I and a is the scale constant. If uj is finite and (3 <C 1 the integral may be 
taken. 

For the frequency of "Japanese candles" of 30 stocks in the stock market (see [13]). 
the dimension is equal to 0. 

Figures 1 and 2 show the rank-frequency curves for Leo Tolstoy's War and Peace. In 
Fig. 1, the low-frequency part of the dictionary is approximated by formula (1321 as (3 = 
(words with frequencies l 3; r = 43408.8, a = 0.891995, and c x = -189.321). For the 
entire dictionary, Fig. 2 shows the deviation of the theoretical data from the dictionary 
data (the difference between the frequency given in the dictionary and the frequency given 
by the formula) against rank. The deviation is seen not to exceed 1.5 words if the rank is 
larger than 300. 

Examples of holes. 

In the Frenkel' theory of crystals, a hole (Dirac's hole) is an absent electron, and it 
behaves as a particle moving along the crystal. However, in contrast to an electron, this 
particle cannot exist outside the crystal; there is no hole in vacuum. In the same way, a 
hole, as we understand it, cannot exist without a sufficiently large scale of spaces in which 
it is "made." 

In fact, the fractal dimension of a hole, a crack in a rock, etc., depends on the scale at 
which "our eye" distinguishes points. For instance, the coastline is of fractal nature when 
observed from a plane. However, if we stand at the very coast, then this line is smooth, 
and its dimension is equal to that of a smooth curve, i.e., to one. 
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Figure 1: War and Peace: the variable r is the word's number in the frequency dictionary in 
ascending order of frequencies; u is the word frequency 

When considering cracks in metal, the dimension depends on the resolving capacity 
of the device used to observe the crack [HI [T51 US] . Denote the resolving capacity of the 
device by k. The usual definition of the Hausdorff dimension is based on the condition 
that some balls (as a rule, three-dimensional in practice) vanish. This means that the 
volume Q of the metal under consideration must be much greater than k 3 . Thus, reducing 
the problem to unit volume, we can see that the radius of the Hausdorff balfj] "tends to 
zero" as Q —>■ oo. 

The hole-wave propagation is especially visual when considering elastic waves in media 
in which the Young modulus is sufficiently large with respect to compression and vanishes 
with respect to extension. In particular, sand has this property. The waves-holes were 
studied in detail by the author together with P. P. Mosolov in the case of one spatial 
coordinate ([18J, Appendix). 

A worm corrodes a complicated serpiginous path-hole in the stem of a tree. This path 
is very small with respect to the stem, and we can assume that the thickness of the stem 
is infinitely large as compared with the hole. 

The dimension of a wire of the same thickness and the same "sinuosity" can be defined 
independently of the surrounding medium, whereas one cannot consider the hole (made 
by the worm) beyond the stem, although both the dimensions have the same absolute 
value. 

Let us now present an economical example. 

1 That is, of the ball used in the definition of the Hausdorff dimension, see |17j . 




Figure 2: The deviation of the theoretical data from the dictionary data for the first volume of 
War and Peace 

Assume that somebody has inherited a great legacy in diverse forms and frivols it away 
"to the left and to the right." If it is complicated to evaluate the amount of legacy, then 
the amount of expenses increasing in time t according to the power law t k (the appetite of 
expenses can increase, as well as the appetite of profits!), then k is the negative dimension, 
or k is the dimension of the "hole" thus occurring. 

Let us now consider the simplest examples of measures in the general case for the 
n-dimensional space. Let S n be an n-dimensional ball of radius r. In the spherical 
coordinates, the volume /i(S n ) of the ball is equal to const f Q r n ~ l dr = const r n . Here 
r n_1 stands for the density. 

In the sense of the Fourier transform, the multiplication by a coordinate is dual to the 
corresponding derivation. Therefore, we can speak of dual n times differentiable functions 
in the Sobolev space W£- Dirac distinguished between the left and right components "bra" 
and "ket" in the "bracket" inner product. The "dual" space of this space according to 
Dirac is the space W^ n of Sobolev distributions (generalized functions). 

In the same way we can define the functions in by the "inner product," where 
s is a positive noninteger number, and the space W^ 1 * as the "inner product" conjugate 
to Wl 

One can similarly proceed with the density (or the weight) r s and r~ s , by using, for 
instance, the Riesz kernel to represent functions in Wif . 

Let us present an example of a space (of noninteger positive dimension) equipped with 
the Haar measure r CT , where < a < 1. 

On the closed interval < x < 1 there is a scale < o < 1 of Cantor dust with 
the Haar measure equal to x a for any interval (0, x) similar to the entire given set of the 
Cantor dust. The direct product of this scale by the Euclidean cube of dimension k — 1 
gives the entire scale k + a, where k G Z and a e (0, 1). We consider the space of negative 



in 




Figure 3: The Japanese candle order number vs. frequency for 30 stocks: R is the candle order 
number, starting from the smallest candle; u is the candle frequency. 



dimension —D = —k — a with respect to this very space. "Quantizing" of the densities 
considered above leads to the density of the form T(D + I)/ (T(D + + 1)). 

The following problem arises further: How one can understand a scale of spaces in 
specific problems on a crack whose dimension t increases as t — > oo? The size (volume) 
of the metal or rock is not a dimension because all these objects are three-dimensional. 
Where a sufficiently large value of dimension arises? 

For comparison, consider a gas in a three-dimensional vessel. Every molecule of gas 
moves. The number of molecules is ~ 10 23 . If a motion of every molecule is considered, 
then, as a rule, one assumes that the entire dimension is equal to 10 69 . 

On the other hand, one can consider all molecules as points of three-dimensional space. 
If we also take into account the displacement at time t, then we can consider the points 
in the four-dimensional space as well. This very duality enabled us (together with my co- 
author P. P. Mosolov) to pass from the Newton equations for every molecule to equations 
of field-like hydrodynamics of the entire gas P22 

If we choose two moments of time for the displacement of the molecules of gas in 
the 10 69 -dimensional space, i.e., consider the displacement of all 10 23 molecules as a single 
point at the first moment and as another point at the other moment, then we can introduce 
the natural distance between these points, i.e., a metric. 

One can construct a scale of metrics between this metric and the metric in three- 
dimensional space by combining diverse groups of particles. 

From the viewpoint of mechanics (elasticity theory) and the scales typical for this 
science, metals, as well as rocks, consist of "grains" of sufficiently small (rather than 
atomic) measure-volume. These grains can be of different size, and we can regard the set 
of these grains as a point in the space of sufficiently large dimension [2TI [22] . This very 
interpretation enables us to pass to the general definition of a fractal crack as a hole and 
present the abstract mathematical definition. 
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General definition of spaces of negative negative dimension. Let M to be a compactum, 
of Hausdorff dimension to, which is an element of a t-parameter scale of mutually embed- 
ded compacta, < t < oo. Two scales of this kind are said to be equivalent with respect 
to the compactum M to if all compacta in these scales coincide for any t > t$. We say that 
the compactum M to is a hole in this equivalent set of scales and the number — 1 is the 
negative dimension of this equivalence class. 
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